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SPECIFICATION 

COMPUTER SYSTEM, SERVERS CONST ITUTINGTHE SAME , AMD JOB EXECUTION 
CONTROL METHOD AND PROGRAM THEREFOR 

TECHNICAL FIELD 

Thepresentinvefi.tionrelatest.ogri dsorapnt ingand, inparticul&r, 
to a scheduling method for assigning jobs to computers and a system 
configuration that implements the Method. 

BACKGROUND ART 

tern ca N n ing has attr? • ' s atts ion 

in recent years that integrates and uses heterogeneous information 
systems interconnected through a network. In this system, computer 
resources such as CPU power and data storage of multiple computers 
on the network are shared so that the system is used as a single, 
virtuaXhigh-perf ormance computer . Bycausingthemultiple computers 
to perform parallel processing, a large amount of processing can 
be performed at high speeds even though the performances of the 
individual computers are low. 

In grid computing, scheduling which determines the execution 
sequence of jobs (units of processing in a program) provided to the 
system according to the characteristics and priorities of the jobs 
is est ta becaus a coranent which 

is inherently heterogeneous, jobs must be assigned to multiple 

users must be properly scheduled. 

into two types: push and pull. In the push type scheduling {see 
Non-patent document 1 for example) , when a job is submitted to a 
scheduler, the scheduler assigns a computer to execute the job and 
requests t e te t - - tet ~ A . Fc oit ft t - , - - eduiei 

monitors the status or the computers and assigns the job to the most 
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suitable computer according to load information of the job, 

rhe push t dull L f ten used in a gr te 

calied a cluster grid that consists of several hundred servers 
{computers} provided at the- same site. The push type scheduling 
scheme can assign a job specifically to the roost suitable computer 
and therefore can provide optimum scheduling. Especially, in an 
environment where there are variations in behavior of computers and 
a job can be canceled on a computer when a. user uses the computer, 

of the system. 

on the other hand, each computer polls a scheduler to request a job 
when the computer becomes ready to execute a job. If there is a 
job to execute when the request is made, the scheduler assigns the 
job to the computer end the computer executes the assigned job. If 
there is no job to execute, the computer polls again after a lapse 
of predetermined time. 

The pull type scheduling scheme can be implemented with a very 
simple configuration and, therefore, it is often used in grid systems 
consisting of several thousand computers. Especially, a grid built 
on the Internet uses this put scheduling scheme because of constraints 
of the network. Furthermore, the pull type scheduling scheme can 
be readily applied to a large number of computers because management 
information about the computers is simple. Self -optimization by 
polling (that is, computers with higher availability per form polling 
'[ a 1 nff- <- etf - «- r to some extent. 

Non-patent document 1 : Chris Smith, M Open Source Metascheduling 
for Virtual Organizations with the Community Scheduler Framework 



f bb, Matt d , 3 ) tr ibut v:pf SET I", 

omputing in Science S Engineering, Vol. 3, Issue i, Jan. -Feb. 2001, 
ages 78-83 



DISCLOSURE OF THE INVENTION 
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PROBLEMS TO BE SOLVED BY THE INVENTION 

A3 described above, the conventional push type scheduling scheme 
in grid computing enables optimum job assignment. However, in order 
to accomplish optimum scheduling, the utilisation of each computer 
nm s t b e m a n a :? e d . Ac c o r d i n g i y , o p «i r a I i o n s o f s c h e d u 1 e r a t: e c omp 1 i c a t e d 
md changes a f atio s " - a s -j - on c rernova 

Furthermore, the push type scheduling cannot be applied to computers 
inside a firewall because the scheduler accesses each computer to 
request it to execute a job, 

accommodate changes to a system configuration because computer 
management information required by the scheduler is simple. 
Furthermore, jobs can. be assigned to computers inside a firewall 
because access on the networkiaperformedbypoiiingfromt he computers . 
Although the pull type scheduling scheme enables self-optimization 
by the polling, it. causes a time loss. That is, even when there 
is a job to execute, a request for execution of the job is issued 
o n .1 y a f t e r p o .1 1 i n g f r cm a c omp u tor. M o r ••: o v er, whena computer perf o r m s 
polling, a ob that is op 

to the computer among jobs to be executed, which may not result in 
the most efficient scheduling for t icvcho 1 ^ - ^ iso, lorona:ng 

on the type of the job, there can be more suitable computers. 

Therefore an object of the present invention is to provide a 
system and a jot execution cornier n~* n rare.- e c* s ; ■>* r j 

status of computers as well as the types of jobs, and readily 
accommodating changes to a system configuration. 

job i ~ - - •"• r type sch i 

a grid including computers access to which is controlled through 
a firewall. 

Yet another object of the present invention is to provide a 
d that is a comb i i e sche Lulls „ n * 

- 3 - 
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pull type scheduling, and to provide a system using the method. 

MEANS FOR SOLVING THE PROBLEMS 

In order to achieve the objects, the present invention provides 
a computer system for performing grid computing by using multiple 

omputersin e ne l± Lgu i >1I 

The computer system includes a center server which is a computer 
requesting computers on the network to execute a job, and process 
servers each of which is a computer executing a job in response to 
a request fromthe center server . Thecente- * - * •rs&duie v " 

issues a job execution request, and an agent section which manages 
i tio t e 5 w t/ ww„t„ jj 

by the scheduler section, and sends the request to the process server 
to which the requested job has been assigned, depending on the status 
of the process server. 

11 jl 1 , t I Kit ) v I 1 i 3 1 v t 

with the multiple process servers in one-to-one correspondence. 

Preferably, theagent sect ionob tains frcfmacorrespondingprocess 
server and manages information about the capacity and operating status 

PI X 1 t f- (r- ^1" J - 1 J ~ <r 1^13 1 q \ 

a lob to the process server on the basis of the information managed 
by the agent section. 

The agent section sends a request received from the scheduler 
section in response to a polling access from the process server, 
or sends a request received from the scheduler section at timing 
managed by the agent section, depending on an access type of the 



i L U it v 

across the firewall, the agent section waits for a polling access 
from the process server before sending a request. 

In another aspect of the present invention that achieves the 
objects described above, a job execution control method is provided 
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it ion of . t iox >nt ro L 

method includes the steps of assigning a lob to a process server 
constituting a system and e a - - ing a job, on the basis of the capacity 
of the process server stored in a storage regardless of the operating 
status of the process server, issuing a job execution request to 
the process server to which the job is assigned, and holding temporarily 
the issued job execution request and sending the job execution request 
to the process server to which the job is assigned, depending on 
the operating status of the process server. 

The present invention further provides a programfor controlling 
a computer to implement the functions of the center server described 

to the steps of the job control method described above. The program 
maybe distributed with a magnetic disk, optical disk, semiconductor 
memory, or other recording medium in which the program is stored, 
or distributed through a network. 



ADVANTAGES OF THE INVENTION 



According to the present in vent ion configured a s de scr ibed above , 
an agent section that relays communication between a scheduler and 
a process server in grid computing is provided on a center server 
nsnt Acucn performs contro.1 to c commodate a difference 
in access types among process servers, thereby enabling optimum 
scheduling in terms of the types and utilisation status of computers 
as well as the types of jobs regardless of the difference in access 
types among the process servers. Furthermore, because the agent 

can be readily modified by adding or removing a process server or 

According to the present invention, because the agent section 
performs control to accommodate a difference in the access types 
among the process servers, a process server capable of directly 
ec ig a joS t i 3 t t igh the push ty| scheduling 

may coexist wits a process sever re reiving a job execution request 
after polling through the pull type scheduling, in the system, and 
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optimum job assignment equivalent to that of the push type scheduling 
can be accomplished. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a general configuration of a grid computing system 
according to an embodiment ; 

, f " t , ~ J 1 ! Jf t 

lit ' ji o >. process server in 

the grid computing cystccn according the 1 n ' * ; 

Figure 3 shows a functional configuration of a center server 
according to the embodiment; 

of a process server that performs polling =si i on 

according to the embodiment ; 

Figure 5 shows a relationship between a functional configuration 
of a process server that does not perform polling and a PS agent 
section according to the invention; and 

Figure 6 is a flowchart illustrating an operation performed 
by a PS agent section when a job execution request is issued according 

* t K t t 

BEST MODE FOR CARRYING OUT THE INVENTION 

The be st mode for carrying out the present invention (hereinafter 
referred to as the embodiment) will be described in detail, below 
with ef - % accompanying drawings . 

Figure 1 iowsage ral fx rati ii rid mpufii 1 
according to the present embodiment. 

As shown in Figure 1 , the systeiaof the pre sent embodiment includes 
a center server { C3 } 100 widen assigns jobs and process servers (PS) 
200 which actually execute jobs assigned by the center server 100. 
The center server 100 and the process severs 200 are interconnected 
through the Internet or other computer network. The computer network 
t e- k - - ^ \s t c 

protocol, and may have a firewall or any other access control. 

- 6 ~ 
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il » jM lie 

suitable for implementing the center server 100 and the process server 
200 in the grid computing system according to the present embodiment . 

The computer shown in Figure 2 includes a CPU (Central Processing 
On it) 11 which is processing means, a main memory 13 connected to 
the CPU 11 through a M/B (mother board) chip set 12 and a CPU bus, 
a video card 14 also connected to the CPU 11 through the M/B chip 
set 12 and an AGP (Accelerated Graphics Fort) , a magnetic disk device 

a low-speed bus sued as an ISA (Industry Standard. Architecture} fans. 

It should be noted that the hardware configuration of the computer 
that imp leme nt s the pr e sen t embed i tne n t s how n. i n Pi gu r e 2 i s i 1 lu s t r a t i ve 
only and any other configurations may be used as long as the present 
embodiment can be applied thereto. For example , instead of the video 
card 14, a video memory alone may be provided and image data may 
be processed in the CPU 11. An external storage device such as a 
CD-R {Compact Disc Recordable} and DVD-RAM (Digital Versatile Disc 
Random Access Memory) drive may be provided through an interface 
s a o h a s a n ATA ( AT A 1 1 a c hme n t) or SCSI { Sma 11 Comp u terS y s t em I n t e r f a ce ) . 

Figure 3 shows a functional configuration of the center server 
100. The center server 100 includes a scheduler section 110 which 
performs job assignment (scheduling; for each process server 200, 
and a PS agent section 120 which manages a process server 200 and 
la \ ts 1 i t v •> 1 j > N 1 ■)>- process 



c "c o-s > ^ N - - > i. The scheduler 

section 110 accesses each process server 2 00 through the PS agent 

] k , - - .n } 10 nay i m. pi cent ed by the program 

controlled CPU 11 and memory means such as the main memory 13 and 
agnet led " j , for example, and includes, 

as specific f tioi is, > 4 . - 111 , 

i ptimum PS ele ~< ecti 112, re es ing e i i 
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113 as shown in Figure 3. 

to the 

PS agent sect orss ^ „ ^ _ - 

200 about the capacity of the process servers 200 to obtain the 
information . 

Ti c PS sele r i , k~ tio elect n optimal ess 

server COO for a job on the basis of the capacity information of 
the process servers 200 obtained by the PS capacity checking section 
111 and assigns the job to that process server 200. Any optimization 
logic may be used in the job assignment. 

The job requesting section 113 issues a request for job execution 
to a PS agent section 120 corresponding to aproeess server 200 selected 
by the optimum PS selecting section 11.2. 

In the present embodiment, the PS agent section 120 relays 
communication between the scheduler section 110 and the process server 
200 and receives a job execution request from the scheduler section 
110 on behalf of the process server 200. Accordingly, the PS capacity 
checking section 111 makes an inquiry to the PS agent 120 and the 
job requesting section 113 issues a request to the PS agent 120. 
However, the function of the scheduler section 110 is the same as 
that of the existing push type scheduler. Therefore, a scheduler 
^o-o ^- i a H *~ s n-d..^*- 

section 110. 

ThePSagentscct u ] ^ i J i it _ ^ - _ parnrjor^' 1 " ed 

CPU 11 and memory means such as the main memory 13 and the magnetic 
di s k devi ce 15 shown in Figure 2 , for example, and includes, as specific 



1 i i v v 1 „v v id ou.v t dr ig 

< C c i ~ , ^ , ~ , - t " 1 f t 1 

recess server 200, 

The PS capacity managing section 122 manages statistical 
f, -> a t d nee the job exe t a acit < i "^e o-o-^s- 

erver 200 and, in response to an inquiry from the PS capacity checking 
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section ill of the scheduler section 110, returns information managed 

v. I iv - - [ ) I 1 1 1 1 „ v t 

concerning the lob execution capacity includes static information 
about the throughput of a CPU and the storage capacity of a storage 
device as we'll as information obtained by statistically processing 
dynamic information such as changes of load on the CPU with time 
and the operation pattern of the CPU. 

Information managed by the FS status managing section 121 and 
the PS capacity managing section 122 is obtained from the process 

t jt j to the PS agent sect t< ^ f 

in T o - . j ^ < " "o ~ i - disk 

device 15 shown in Figure 2. 

The job receiving section 123 receives a job execution request 
issued by the nq section 13 of the scheduler section 

110. 

The job requesting section 124 transfers a job execution request 
received at the job receiving section 123 to the corresponding process 
server 200. 

The polling waiting section 125 receives notification that the 
process server 200 is ready to execute a job, from the process server 
200 through polling. 

i ji> ep.t e e vliment , whiii pu p* file 
is used, a polling access from a process server 200 can be accepted 
and a job execution request can be sent in response to the polling 
access, which will be detailed later. The polling waiting section 
125 in the PS agent section 120 is used for receiving such polling 
and, therefore, is not a necessary component for the E igent section 
120 for a p - r re ei in <, \ it io . f oiling, 

a job execution request sent from the center server 100 at. desired 
timing in the center server 100. 

A relationship between a functional configuration of a process 
server 200 and a corresponding PS agent section 120 will be described 
below. 

As mentioned stove, some of the p roses s servers 200 assumed 
i ~ e{ - - o - T - - f ' a- - h-i ?. is not perform 

polling . 
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L _ ^ <■* is- „ iv't-^L f , - r _ , ^ _ , i L „ u 1 t L L < i 

of a process ser ->r 2: t - 1 per for s polling and z rrespo ling 
PS agent section 120. 

The process? sever 2 i bed - which 

allows a computer as shown in Fi.gc.rc 2 no function as a process server 
200 in the grid competing system. 

The PS embedded section 210 Bay be implemented by the program 
controlled CPU 11 and storage means such as the main memory 13 or 
thexaagne tic disk device 15 shown in Figure 2, for example , and includes , 

Figure 4. 

The PS status monitoring section 211 monitors the current, 
utilisation status of the process server 200 and status of resources 
to collect information. 

The PS status notifying section 212 provides information about 
the utilization status of the processing server and the status of 
the resources collected by the PS status monitoring section 211 to 
the PS agent section 120 of the center server 100, The PS status 
managing section 121 and the PS capacity managing section 122 in 
i \ 1 0 re^ei <~ e o ! 1 > re 1 in 

a storage device such as the main memory 13 or magnetic disk device 
J * i I lis . Notification from the PS status notifying 
section212 totrm icm r.cni. C> .Ov- v dmcgj ninlcncis 

or may be provided when the operating status cf the prow-'- j >rv< \ 
200 has changed. Alternatively, the corresponding PS agent section 
120 on the center server 100 may make an inquiry to the process server 

The job receiving section £13 receives a job execution request 
sent from the job requesting section 124 of the PS agent section 
120 of the center server 100. 

The job executing section 214 executes a job received at the 
ire i s o- *d t j t 

200. 

The polling section 215 notifies the PS agent section 120 on 
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the center server 100 that the process server £00 is ready to execute 
olli ti 0 j cr 35 ebasi f information 

nonit< edi t eE stat s nit isect r m l ej < raiting 

section 125 in the PS agent section 12 0 receives the notification 
from the polling section 215 and causes the job requesting section 
124 to send a job execution request. 

It should be noted that the polling section 215 can be omitted 
fros« a process server 200 if the server is capable of receiving a 
jobexecutionrequestsentrrc t ^.^,Ev^.i„vL sc i hoi nning 
polling, that is, if the process server 200 is not placed inside 
a firewall or otherwise access controlled. 

Figure 1 show i * „ Ov. i i t ~< i t f 'in 

of a process server 200 that does not perform polling and a PS agent 
section 120. 

Operations of the grid computing system configured as described 
above according to the present embodiment will be described below. 

As mentioned above, the scheduler section 110 of the center 
server 100 is the same as the esistingpush type scheduler . Therefore, 
when a job to be executed occurs, the scheduler section 110 tries 
to assign the job to any of the process servers 200 being managed 
by the center server 100. The scheduler sect! on llOobtains statistical 
information about the capacities andoperat ion pa item s of the process 
servers 200 from the PS agent sections 120 and performs optimum 
scheduling on the basis of the information and the type and 
characteristics of the job. The scheduler section 110 issues a job 
execution request regardless of the operating status of the process 
server 200 to which the job has been assigned, and sends it to the 
PS agent section 120 corresponding to the process server 200 to request 
it to execute the job. 

i ca ' 1 1 * ore v _ % " I _ •„> ■„ i le of receiving 
a job execution request directly from the center server 100, or the 
one receiving a job execution request af ter pol ling because the process 
server 200 is placed inside a firewall. 

F j ire c s a 4 l r art st rating an operation of a PS agent 

section 120 when it receives a job execution request. 
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As shown in Figure 6, the ?3 agent section 120 receives a job 
execution request from the scheduler section 110 through the job 
receix ngse t < stepSGl) . Ifapro ess - re 0 s-r.tespo ding 

to the PS agent section 120 is of the typo capable or directly receiving 
a job execution request, the E'S agent section 120 imreiayly sends 
the job execution request to the process server 200 {steps 602 and 
604} , 

On the other hand, if the process server 200 corresponding to 
the PS agent section 120 is of the type that performs polling to 

server 200 (step 604} . If the PS agent section 120 does not have 

So ex< - c r » 3 - - t * t s \ i i 1 s x t< - ss - - polls 

the PS agent section 120 performs no action and waits until a job 
and next polling are received. 

If the process server 200 is of the type capable of directly 
receiving a job execution request from the center server 100, the 
process server 200 receives the job execution request, executes the 
job according to the request, and then returns the result of the 
execution to the PS agent section 120 of the center server 100. 

On the other ; a; 0, if the process server 200 is of the type 
that performs polling before receiving a job execution request, the 
process server 200 performs polling -when it becomes ready to execute 
a job. Then the process server 200 waits until a job execution request 
is sent, h?hen it receives a job execution request sent from a 
corresponding PS agent section 120 on the center server 100, the 



a job execution request, rt repeats porting alter a lapse or 
predetermined time. 

The operation of the process server 200 described above is the 
same as that of a process server in a conventional grid computing 
system. However, in the present embodiment, each of the PS agent 
sections 120 on the center server 100 corresponding to the respective 
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process servers 2 enc v ether to £ i t i requesl 

to a to 
wait for polling access from the process server 200 before sending 
a job execution request, according to the access type of the process 
server 200, as described above. This means that, the difference in 
the access types among the process servers 200 can be accommodated 
by the control by the PS agent sections 120. Therefore, a process 
server 200 capable of receiving a job execution request directly 
from the center server 100 can coexist with a process server 200 

t ^_Lv.s_l tv I 111 i\ Lv tl\I Uj 

a lob execution request in the system according to the present 

As described above, the configuration according to the present 
embodiment can include a process server 200 that performs polling 
before receiving a job execution request. Even in this case, the 
scheduler section 110 can perform optimum scheduling based on the 
capacity of theprocess server 200 as wellas the typeandcharacteristics 
of a job regardless of the operating status of the process server 
200 because the scheduler section 110 issues the job execution request 
to the process server 200 through the PS agent sections 120. 

Furthermore, according to the present embodiment, the PS agent 
sections 120, each of which corresponds to one process server 200, 
are provided on the center server 100, and the PS agent sections 
120 manage information about the corresponding process servers 200 
and control transmission and reception of requests and responses 
as described above, Furthermore, the scheduler section 110 assigns 

by j Kh „ v v j.^ tv t the 

press server. 
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